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Abstract — A space-time adaptive decision feedback (DF) 
receiver using recurrent neural networks (RNN) is proposed 
for joint equalization and interference suppression in direct- 
sequence code-division-multiple-access (DS-CDMA) systems 
equipped with antenna arrays. The proposed receiver structure 
employs dynamically driven RNNs in the feedforward section 
for equalization and multi-access interference suppression and 
a finite impulse response (FIR) linear filter in the feedback 
section for performing interference cancellation. A data 
selective gradient algorithm, based upon the set-membership 
design framework, is proposed for the estimation of the 
coefficients of RNN structures and is applied to the estimation 
of the parameters of the proposed neural receiver structure. 
Simulation results show that the proposed techniques achieve 
significant performance gains over existing schemes. 

Index Terms — DS-CDMA, multiuser detection, neural net- 
works, space-time processing, adaptive receivers, set-membership 
techniques. 



I. Introduction 

CODE division multiple access (CDMA) implemented 
with direct sequence (DS) spread-spectrum signalling 
is currently deployed in many wireless communication sys- 
tems. Such services include third-generation (3G) cellular 
telephony, indoor wireless networks, terrestrial and satellite 
communication systems. The advantages of CDMA include 
good performance in multi-path channels, flexibility in the 
allocation of channels, increased capacity in bursty and fading 
environments and the ability to share bandwidth with narrow- 
band communication systems without deterioration of either's 
systems performance |fl~), J2). 

The capacity of a DS-CDMA network is limited by dif- 
ferent types of interference, namely, narrowband interference 
(NBI), multi-access interference (MAI), inter-symbol interfer- 
ence (ISI) and the noise at the receiver. The major source of 
interference in most CDMA systems is MAI, which is highly 
structured and arises due to the fact that users communicate 
through the same physical channel with non-orthogonal sig- 
nals. The multiple propagation paths of the channel can destroy 
the orthogonality of the signals at the receiver, giving rise 
to MAI even when the users employ orthogonal signatures. 
For non-orthogonal signatures the channel and the lack of 
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synchronism between users can further increase the level of 
MAI. The conventional (single-user) receiver that employs a 
filter matched to the signature sequence does not suppress 
MAI and is very sensitive to differences in power between the 
received signals (near-far problem). In order to mitigate the 
effect of interference in CDMA systems, a designer can resort 
to multiuser detection techniques for MAI suppression, and 
antenna arrays, which perform spatial filtering and can reduce 
the impact of interference. The optimal multiuser detector of 
Verdu [3| suffers from exponential complexity and requires 
the knowledge of timing, amplitude and signature sequences. 
This has motivated the development of various sub-optimal 
strategies: the linear [4] and decision feedback [5| receivers, 
the successive interference canceller [6| and the multistage 
detector [7|. The combination of multiuser detection and 
beamforming can provide an enhanced performance for MAI 
and ISI suppression JT], 0. This requires the joint processing 
of the data received at an antenna array with elements closely 
spaced and the estimation of the direction of arrival (DoA) 
and the channel information. 

In current third-generation (3G) wireless cellular communi- 
cations, there are some specific features in the operation of the 
systems such as the use of antenna arrays by the base stations 
for increasing the capacity in the uplink (mobile terminals to 
base station) and different data rates and quality of service 
requirements JH]. In particular, third-generation wideband DS- 
CDMA systems have multi-code capabilities, in which the user 
signals with high data rate (HD) can be accommodated by 
reducing the processing gain Nh and using a low spreading 
factor and higher power levels [8 |. In these scenarios, users are 
equipped with multiple processing gains, operate at different 
data rates, and HD users operate at higher signal-to-noise 
ratios (SNRs). The SNR is proportional to their data rates, 
which leads to a situation where they experience a relatively 
low level of MAI but more significant level of ISI. In contrast, 
low data rate (LD) users, that work with larger spreading 
factors and lower SNRs, encounter higher levels of MAI and 
near-far effects. 

In order to design an effective space-time receiver for in- 
terference mitigation in DS-CDMA systems employing multi- 
code capabilities and antenna arrays, we investigate in this 
work the use of receiver structures based on recurrent neural 
networks and decision feedback schemes. One motivation for 
this work is that neural networks provide good nonlinear 
mapping of the inverse model of the channel and can deal with 
the uncertainty and interference present in the received data. 
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Therefore, receivers based on neural networks outperform 
those based on FIR linear filters. Despite the increased com- 
plexity over receivers with FIR linear filters, the deployment 
of neural structures is feasible in situations where the number 
of processing elements is not very large, i.e., the spreading 
factor of CDMA systems is low and the number of HD 
users is small. Such situations appear in the context of 3G 
wireless communications |8|. Recent work on equalization 
and modelling of nonlinear systems has shown that recurrent 
neural networks (RNNs) are superior in bit error rate (BER) 
performance to multilayer perceptrons (MLPs), radial basis 
functions (RBFs) J5) and FIR linear filters. Prior work on 
equalization structures also demonstrated that decision feed- 
back (DF) neural equalization structures are better than neural 
schemes without DF and DF and linear equalizers with linear 
FIR filters [10|. In the context of CDMA systems, existing 
works on neural and related structures are restricted to MLPs 
ifTTl . RBFs Q2), support vector machines (SVMs) Q2) and to 
static channels, and little attention has been given to RNNs 
and DF structures. This is especially relevant for the uplink 
of wireless communications systems, where the base station 
can afford an antenna array and more complex algorithms as 
compared to the mobile terminals. In this case, the trade-off 
between computational complexity and superior performance 
is quite attractive. 

In order to estimate the parameters of the receiver, a 
designer must resort to a training algorithm. Amongst the 
available training methods for neural detectors, one can em- 
ploy gradient based approaches such as the back propagation 
algorithm for MLPs [9| and the real time recurrent learning 
(RTRL) or fast least-squares (LS) and Kalman-based tech- 
niques 1 14 1 for RNNs. On one hand, gradient-type approaches 
for neural schemes are simpler to implement, exhibit low 
complexity but possess slow convergence rates and have to 
deal with the problem of vanishing gradients [9| for RNNs. 
On the other hand, LS and Kalman algorithms have fast 
convergence but exhibit very high computational complexity 
and numerical problems for implementation. This motivates 
the search for novel approaches to training RNNs that possess 
fast convergence, low complexity and flexibility. The algorithm 
can be also applied to problems such as estimation and 
tracking of non-linear and dynamic processes and modelling 
of time-varying non-linear systems (9). In this context, set- 
membership (SM) filtering [15|, [16| represents a class of 
recursive estimation algorithms that, on the basis of a pre- 
determined error bound, seeks a set of parameters that yield 
bounded output errors. In particular, the SM approach leads 
to low-complexity gradient algorithms with excellent conver- 
gence and tracking performance due to the use of an adaptive 
step size and data selective updates. The appealing feature of 
SM algorithms is that they are able to significantly improve 
the convergence and tracking of estimators and possess a 
complexity lower than conventional estimation algorithms due 
to their sparse updates. The existing works on SM algorithms 
deal with the adaptation of linear FIR filters and there is no 
SM algorithm for training important neural structures such as 
RNNs available. 

In this work, our first contribution is to present an adaptive 



space-time adaptive DF multiuser receiver structure for HD 
users with antenna arrays, using dynamically driven RNNs 
J9) in the feedforward section and FIR linear filters in the 
feedback section for cancelling the associated HD and LD 
users. These RNNs have not been considered for the problem 
of space-time multiuser detection so far. The advantage of 
antenna arrays lies in the spatial separation of the interfering 
users, increasing the capacity of the system. The proposed 
structure employs antenna arrays combined with a neural RNN 
processor and a DF structure, which are shown to provide 
considerable capacity gains over schemes with FIR linear 
filters. The second contribution of this work is the development 
of a novel data-selective algorithm for parameter estimation in 
RNNs using the set-membership estimation framework [15], 
[16|. The proposed algorithm is applied here to the problem 
of space-time neural receivers for DS-CDMA systems, even 
though, the algorithm is general and can be used for other 
applications of neural networks. It should also be remarked 
that some very recent contributions |[T7l - ll2"4"l . which constitute 
the state-of-the-art in the area of RNN, can be considered for 
further investigation in the context of interference mitigation 
in communication systems such as CDMA, OFDM and MIMO 
[25|. However, we have opted to focus here on a RNN 
scheme combined with FIR filters in a decision feedback 
architecture, its application to space-time CDMA systems and 
the development of a novel estimation technique. 

This paper is structured as follows. Section II briefly 
describes the space-time DS-CDMA communication system 
model and formulates the problem. The proposed space-time 
DF neural receivers for joint equalization and multiuser detec- 
tion are presented in Section III. The novel set-membership 
adaptive algorithms for the neural structures are presented in 
Section IV. Section V is dedicated to the presentation and 
discussion of the simulation results and Section VI gives the 
concluding remarks of this work and comments on possible 
extensions and future work. 



II. Space-time DS-CDMA system model and 

PROBLEM STATEMENT 



Let us consider the uplink of an asynchronous multi-code 
QPSK DS-CDMA system with L paths and 2 classes of 
K = Kh + Ki users, where Kh is the number of HD users 
with Nh chips per symbol and Ki the number of LD users 
with Ni chips per symbol, as depicted in Fig. 1. These QPSK 
symbols are spread with unique sequences for each user, 
modulated and transmitted over a communication channel. 
At the receiver equipped with antenna arrays, the composite 
signal is demodulated, sampled at chip rate and applied to a 
bank of filters matched to the users' effective spatial spreading 
sequences. In what follows, we mathematically detail these 
operations and describe the signal processing. 

The transmitted signal for the fcth user amongst the HD 
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users and the qth user amongst the LD users are given by 

oo 

**(*)= 4fe E b%(i)s h k (t-iT h ) 

(1) 

x l q (t)=A l q £ tf k (i)a l q (t-iTi) 

1— — OG 

where the equiprobable symbols b\(i) and b l q (i) £ 
{±1,±j'}/v2 with j 2 = —1 denote the i-th symbol for 
users k and q, the real valued spreading waveforms and the 
amplitude associated with HD user k and LD user q are s\(t), 
s l q (t), A\ and A q , respectively. 

The spreading waveforms for the HD and LD users 
are expressed by s\{£) = Yli^i a fe(*)0(^ — *^c) an d 
S L(*) = E« a i:(')^ — iT c ), respectively, where a%(i) £ 
{±l/ViVft"}, a' fc (i) € {±1/V3V/} . 0(0 is the chip waveform, 
T c is the chip duration and N h = T h /T c and N t = T ; /T c 
are the processing gains of the HD and LD users, respec- 
tively. In practice, HD users can use Nh — (3N, being 
(3 = 2,4,8,16,32, the smallest N = 8 and the largest 
N] r — 256 lf3Tl . Assuming that the receiver with linear antenna 
arrays is synchronized with the main path and identical fading 
is experienced by all antenna elements for each path of each 
user signal, the coherently demodulated composite received 
signal at the pth antenna element is 

»*(*) =EE hl m (t)e-'<-x h k (t ~ r>\ m 4) 

fc=l m=0 
Ki L-l 

g— 1 m=0 

(2) 

where ©fc,m = 27r b- sin(0£, m ) and 6^ m = 2tt(p- 

l)(d/A) sin(0^ m ) are the delay shifts of the mth path of the 
fcth HD user and the qth LD user, respectively. The quantities 
0fc m an( ^ 4>q m are me directions of arrival (DoA) of the 
signals of the HD user fc and LD user q and their rath paths, 
d = A/2 is the spacing between sensors and A is the carrier 
wavelength. The channel coefficients associated with the ra- 
th path and the fc-fh HD user are ti£ m (t), d^ € [0, N h ) 
is the delay of the fcth user taken from a discrete uniform 
random variable between and Nh and r^ m is the delay of 
the mth path of the fcth user, whereas the channel coefficients 
associated with the rn-th path and the q-th LD user are h l q m (t), 
d l q £ [0, Ni) is the delay of the qth user taken from a discrete 
uniform random variable between and N and r l q m is the 
delay of the mth path of the qth LD user. The coefficients 
of the channels for both LD and HD users are modelled as 
a tapped-delay-line and their coefficients can be computed 
according to an available standard such as the UMTS model 

ED. 

We assume that the delays are multiples of the chip periods, 
the channel is constant during each symbol interval (the 
coherence time of the channel does not exceed that of the 
symbol - this is typical in 3G systems [8|), the spreading codes 
are repeated from symbol to symbol, and the receiver with 
a J-element linear antenna array is perfectly synchronized 



with the main path. The complex envelope of the received 
waveforms after filtering by a chip-pulse matched filter and 
sampled at chip rate yields the discrete-time samples for the 
user of interest at the pth antenna element 

K h L-l 

k=l m=Q 
K t L-l 

+ E E K,m e ' j@ '"' mxl q ,m( nT c - T l q>m - d l q ) + n p (n) 

q—1 rn—0 

(3) 

By collecting these samples from each antenna element and 
organizing them into a JM x 1 observation vector correspond- 
ing to the ith signalling interval, we obtain 

K h K, 

r (0 = E xfc (*) + E z « (*) + »*(•) + n (*) w 

k=l q=l 

where M = Nh + L — 1, the fcth HD user signal is given by 
Xfc(i), the qth asynchronous signal of LD users that impinges 
on the antenna array is represented by z q (i). The (JM x 1) 
vector z q (i) is given by z q (i) — [z qt i(i) . . . z q ^ JM (i)] T , where 
z (i) — ^T L ~ 1 h l p~ j@l «,™-x l (nT-T l -d l ) 

The complex Gaussian noise vector is n(i) = 
[rii(i) ... njM{i)] T with zero mean and E\n(i)n H (i)] = 
cr 2 I, (-) T and (-) H denote transpose and Hermitian transpose, 
respectively, the operator E[-} stands for expected value, and 
rj(i) is the intersymbol interference (ISI) vector. Indeed, 
the JM x 1 vector of ISI Tf(i) is constructed with linear 
combinations of the channel coefficients, the symbols from 
previous, current and succeeding symbols and chips of the 
spreading codes of the HD users, similarly to z q (i). In this 
model, the ISI span and contribution r)[i] are functions of the 
processing gain Nh of HD users and the number of paths L. 
If 1 < L < Nh then 3 symbols would interfere in total, the 
current one, the previous and the successive symbols. In the 
case of Nh < L < 2Nh then 5 symbols would interfere, the 
current one, the 2 previous and the 2 successive ones. In most 
practical CDMA systems, we have that 1 < L < Nh and then 
only 3 symbols are usually affected. The reader is referred 
to H] for further details related to the ISI and models. The 
UMTS channel models BTI . which reveal that the channel 
usually affects at most 3 symbols (it typically spans a few 
chips) provides further details about the typical channels. In 
the case of HD users it is substantially more significant due 
to the ratio L/Nh which is greater than L/Ni. 

The main problem is how to detect the HD users with 
the minimum probability of error possible and at reasonably 
complexity. The minimum probability of error detector was 
proposed by Verdu J2], and corresponds to a combina- 
torial problem with exponential complexity with the number 
of users. In what follows, we seek the design of a high- 
performance detector with affordable complexity by proposing 
a decision feedback structure with recurrent neural networks. 
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III. Proposed Space-Time Decision Feedback 
Neural Receiver 

The space-time decision feedback receiver structure, de- 
picted in Fig. 2, applies a bank of ST-RAKE detectors ll25ll to 
the observation vector r(i), followed by a neural MUD that 
employs dynamically driven recurrent neural networks (RNNs) 
in the feedforward section for suppressing MAI and ISI and 
an FIR linear filter in the feedback section for cancelling the 
associated users in the system. The RNNs [9| used in the 
multiuser detector (MUD) are small structures, with feedback 
connections and where each artificial neuron is connected to 
the others, capable of achieving superior performance to MLPs 
and RBFs [9]. In this context, the advantage of the neural 
MUD over linear ones is the use of non-linear mappings to 
create decision boundaries for the detection of transmitted 
symbols. 

We consider a one shot approach (detection of the symbols 
corresponding to one time instant) where the Kh x 1 input 
vector u(i) = [ui(i) . . . UK h {i)] T to the MUD is given by 



u(i) = S ff r(i), 



(5) 



where S = [if ... s^-J and is the user k effective spatial 
signature of the fcth HD user that assumes the knowledge (or 
estimates) of the DOAs and channels or their estimates. The 
spatial signature s£ results from the convolution of the original 
signature sequence sjj with the channel and the time shifts 
resulting from the antenna array, and can be represented as 
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fc,l 



(6) 



where Sfc, p = [5k, P ,i . . . Sk, P ,M] is the M x 1 signature at 
the pth antenna of the receiver. 

Let us now describe in detail the schematic of the proposed 
space-time decision feedback neural receiver depicted in Fig. 
2. Specifically, the proposed receiver design is equivalent to 
processing the output u(i) of the bank of ST-RAKE detectors 
with the RNN in the feedforward part of the receiver and the 
decision feedback section. The Kh x 1 data vector u(i) is 
stacked with the Q x 1 vector of states of the RNN 



X fc (i-l) = p(Wf (!)€*(»)), 



(7) 



where the matrix W k (i) = [w M (i) ... w k ,j(i) ■■■ Wfc,g(i)] 
with dimensions (Q + Kh) X Q and whose Q columns Wkj(i), 
with j = 1, 2, . . . , Q have dimensions (Q + Kh) x 1 contain 
the coefficients of the RNN receiver for user k and ip(.) is the 
activation function of the RNN and which is used to form the 
(Q + K h ) x 1 vector 



\i-l) u T (i)} 



(8) 



which is processed by the neural part of the receiver. The 
activation function is chosen as (p(-) = tanh(-) in order to 
limit the amplitude range of the output signals of the network 
and to avoid numerical instability in the recurrent structure [?]. 
In particular, the tanh(-) fits well in communications problems 
as it provides naturally a soft estimate of symbols. In the case 
of DS-CDMA systems, it provides a soft estimate which is 
fed back to the neural structure and helps it to inversely map 



the symbol estimate appropriately in the presence of multiple- 
access interference and noise. The RNN in the feedforward 
section then processes £ fc («) to yield the initial decisions as 
given by 

b k (i) = w(Dxjk(i)) = Bgn(R[Dx fc (i)]) + jSgn(3[Dx fc (z)]) 

= B gn(»[Dp(Wf (<)&(»))]) +jsgn(Qr[D^(Wf (*)&(»))] 

(9) 

where v(-) denotes a slicing function that contains the op- 
erators 3?(.) and which are used to select the real and 
imaginary parts, respectively, and then quantizes the output 
of each operator in order to detect the QPSK symbols. The 
matrix D = [l ... 0] is the 1 x Q matrix that defines 
the number of outputs of the network and and sgn(.) is the 
signum function. Since we are only interested in one symbol at 
each time instant the vector D has this structure which means 
that only one output of the RNN is used to yield the symbol 
estimate. The initial symbol estimates for each user (including 
the HD users and the LD users which use an MMSE linear 
receiver j2j) are then stacked in order to form the K x 1 
vector with the initial decisions h(i) = [b\(i) 62 W • ■ ■ i>K(i)]. 
The estimate of the desired symbol for user k at symbol i then 
becomes: 



z fc (z) = D^(Wf(*)^(z))-ff (i)b(i) 



k = 1,2, 



■ ,K h 
(10) 

where the K x 1 vector (k = 1, 2, . . . , Kh) represents the 
feedback filters used to cancel the interference contribution of 
the associated users. In particular, the feedback filter ffc(i) of 
user k has a number of non-zero coefficients corresponding to 
the available number of feedback connections for each type of 
cancellation structure. The final detected QPSK symbol is: 



3gn(K 



Zk(i) 



jsgr 



n(Q z k (i) 



D V (W?(i)€ fc (i))-ff(i)b(i) 



-jsgn( 
(11) 



D V (wf(i; 



Regarding the structure of the decision feedback (DF) 
section, the literature reports two basic types of feedback, 
namely, the successive DF [7] and the parallel DF [27|. For 
successive DF (S-DF) ll26ll and a system with an equal number 
of feedback entries and users K, we have the K x K matrix 
F(i) — [fi(i) . • • iK h (i)\, which is formed with the filters for 
the K users, is strictly lower triangular, whereas for parallel 
DF (P-DF) ll27ll F(z) is full and constrained to have zeros 
on the main diagonal in order to avoid cancelling the desired 
symbols. The S-DF structure is optimal in the sense of that it 
achieves the sum capacity of the synchronous CDMA channel 
with AWGN l26l . In addition, the S-DF scheme is less affected 
by error propagation although it generally does not provide 
uniform performance over the user population, which is a 
desirable characteristic for uplink scenarios. In this context, the 
P-DF system can offer uniform performance over the users but 
is it suffers from error propagation in low signal-to-noise ratios 
(SNRs) and high bit error rates (BERs). In order to design 
the DF receivers and satisfy the constraints of the associated 
structures, the designer must obtain the vector with initial 
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decisions b(i) and then perform to the adopted cancellation 
approach. 

Unlike prior work, we focus on a scenario where there are 
Kh HD users, i.e., Kh feedback filters f k (i) with dimensions 
if xl as the filters are also supposed to cancel the contribution 
of the LD users. Thus, in this case the matrix of cancellation 
filters F(i) — [fi(i) fz(i) ■ ■ ■ iK h {i)] T has dimensions K x 
Kh- Due to its property of offering uniform performance over 
the user population and better performance as compared to 
S-DF schemes (27), |28], |29), we will adopt P-DF schemes 
and introduce a modification for including the LD users in 
the cancellation process. The feedback connections used and 
their associated number of non-zero filter coefficients in ffe 
are equal to K — 1 for the HD users. The resulting matrix 
F(i) for the proposed neural receiver is full and constrained 
to have zeros on the diagonal from the first row to the K^th 
row in order to avoid cancelling only the desired symbols of 
the HD users. The rest of the matrix is full such that all LD 
are addressed by the feedback section. 

IV. Set-Membership Adaptive Algorithms 

Set-membership filtering (SMF) [15], [16] represents a class 
of recursive estimation algorithms that, on the basis of a 
pre-determined error bound, seeks a set of parameters that 
yield bounded filter output errors. These algorithms have been 
used in a variety of applications such as adaptive equalization 
iTTBI and multi-access interference suppression [16|. The SMF 
algorithms are able to combat conflicting requirements of 
fast convergence and low misadjustment by introducing a 
modification on the objective function. In addition, these 
algorithms exhibit reduced complexity due to data-selective 
updates, which involve two steps: a) information evaluation 
and b) update of parameter estimates. If the filter update 
does not occur frequently and the information evaluation 
does not involve much computational complexity, the overall 
complexity can be significantly reduced. The adaptive SMF 
algorithms usually achieve good convergence and tracking 
performance due to an adaptive step size for each update, and 
reduced complexity resulting from data selective updating. 

In this section, a novel efficient parameter estimator for 
neural structures based on the SMF framework lfj"5l . [TTBI is 
described. Unlike prior work on FIR filters, we propose set- 
membership (SM) algorithms for RNNs structures. In the SM 
framework, the receiver weight vectors w k j(i) and f k (i) are 
designed to achieve a specified bound on the magnitude of the 
estimation error e k (i) = b k (i) — (Dxfe(i) — f^(i)b(i)). As a 
result of this constraint, the SM adaptive algorithm will only 
perform filter updates for certain data. Let Q k (i) represent the 
set containing all w k j(i) and f k (i) that yield an estimation 
error upper bounded in magnitude by 7. Thus, we can write 



sets ip k % = Dm=i ^ fc m P rov ided by the observations, where 
n k , m = {w w ,G C«+*"\ f fc G C K : \b k (m) ~ z k (m)\ < 
7}. In order to derive an SM adaptive algorithm using point 
estimates for the proposed DF receiver structure, we consider 
the cost function 

£(W fc (i),f fc (t)) = £;[|6 fc (i)-(Dx fc (»)-f i f(i)b(i))| a ] (13) 

subject to \e k (i)\ < 7, where e k (i) = b k (i) - (Dx fc (j) - 
fj^(i)b(i)). A gradient algorithm can be developed by com- 
puting the gradient terms with respect to w k j and ffc, and 
using their instantaneous values. Firstly, we consider the 
first partial derivative of C(W k (i),i k (i)) with respect to the 
parameter vector w k j(i) with dimension (Q + Kh) x 1, which 
forms the matrix Wfe 



a£(w fc (i),f fc (i)) 



9e k (i) \ * 
-T>A k>j (i)e* k (i 



(i) = -D 



dx fc (i) 



\et(i) 



(14) 



where the Kh x (Q + Kh) matrix A k j(i) contains the partial 
derivatives of the state vector x. k (i) with respect to w k j(i). 
To obtain the expressions for updating the matrix A k j(i), we 
consider the update equations for the state vector x k (i) given 
through (7) and (8). Using the chain rule of calculus in (8), we 
obtain a recursion that describes the dynamics of the learning 
process of the neural section of the receiver: 

A fcj (*+l) = * fc w(w fc :K WA fcJ W+U fcj (z)), j = l,...,K 

(15) 

where the Q x Q matrix ~W^ K denotes the submatrix of 
Wfe formed by the first Kh rows of Wfe, the Q x Q matrix 
for user k has a diagonal structure where the elements 
correspond to the partial derivative of the activation function 
ip(.) with respect to the argument wj^(i)£ fe (i) as given by 



* k (i) - diag(<p\w?M)t k (fi), ^(wf *(»)€*(»)) 



(16) 

where the K h x (Q + K h ) matrix U kJ (i) = [0 T T ] T 
has all the rows with zero elements, except for the j-th row that 
is equal to the vector £ k (i). By using a gradient optimization 
that uses the rules w fcj (i + 1) = w k ,j(i) - /^n ^^i^f) W) , 

f fe (i + l) = fk(i)-Vf 9C(W (£tyf k(i)) and substituting (14) and 
et(i)h(i), we obtain 



ac(w fc (i),f fc (i)) 



w k ,j (i + 1) = Wfc,j(i) +/inDAy(i)eJ(i) (17) 
f fc (t + 1) = f fc (i) - fi f b(i) (18) 



e*(*)= fl {wfe J ,eC«+ / ^), ffe eC 
(u(i),b fc (i))es 



K 



In order to ensure the constraint on the error bound (|efc(£)| < 
e k (i) \< 7} 7)> we substitute (17) into (13) and then we make the terms 
equal to 7. We proceed with the feedback filter i k (i) in the 
same way, which yields the algorithms 



(12) 



where S is the set of all possible data pairs (u(i),b k (i)) and 
the set 0fc(i) is referred to as the feasibility set and any point in 
it is a valid estimate z k (i). Since it is not practical to predict 
all data pairs, adaptive methods work with the membership 



w k ,j(i+l) 



w fcj(«) 



(1- 



if |efc(i)| > 7 

otherwise 
(19) 
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&(< + !) = ( ffeW - SS e «^ if |e fc «l>7 
[ ffe(i) otherwise 

(20) 

where ^„ = (1 - li ^)/||Afc ji (i)|| 2 and M/ = (1 - 

j^jjj^) / (h H (i)h(i)) are the new normalized step sizes of 
the algorithms. The novel adaptive technique is called the 
SM normalized real time recurrent learning (SM-NRTRL) 
algorithm. It introduces a powerful variable step size and 
discerning update rule. 

V. Simulation Experiments 

In this section we assess the performance of the proposed 
space-time neural receivers and SM algorithms in terms of the 
bit error rate (BER). We compare the proposed neural MUD 
(NMUD) and DF neural MUD (DF-NMUD) receivers with the 
RAKE (251, the linear MUD (L-MUD) and the DF-MUD with 
J = 1 and their space-time versions with J = 2, 3 antenna 
elements. We also evaluate the convergence performance of the 
adaptive estimation algorithms, i.e., the NLMS, the NRTRL 
0, the SM-NLMS fl3) and the proposed SM-NRTRL. For 
the NMUD we make ffc = in the structure and algorithms 
of Sections III and IV. 

The DS-CDMA system employs, QPSK modulation, Gold 
sequences of length N = 7 or random sequences of length 
N = 8 for the HD users and Gold sequences of length 
Ni = 31 for the LD users and the number of HD and LD 
users is equal (Ki — Kh) in all experiments. We adopted an 
identical ratio of LD to HD users for simplicity. In a practical 
system, it is expected that the number of LD users significantly 
exceed the HD ones since the LD users correspond to voice 
users (major part of users and market), whereas the HD users 
correspond to multimedia and data users. Another issue is that 
since a number of DS-CDMA systems in use include higher- 
order modulation schemes, besides QPSK, it should be noted 
that the proposed system can be modified to handle higher- 
order modulation schemes such as QAM- 16 and QAM-64. 
In this case, it would be necessary to modify the detection 
rule in the receiver. Specifically, the sheer would have to take 
into account the different levels of the signal constellation. 
The LD users also operate asynchronously with single-antenna 
MMSE linear receivers [25 1 and a power level 3 dB below 
that of HD users. The channels experienced by different users 
are i.i.d. whose coefficients for each user k (k — 1, . . . ,K) 
are hk,i(i) — Pi(*k,l(i)> where pi are the gains (or average 
powers) of the channel paths (power delay profile) and ak,i(i) 
(I = 0, 1, . . . , L — 1) are complex Gaussian random sequences 
obtained by passing complex white Gaussian noise through 
a filter with approximate transfer function c/yl — (f / fd) 2 
where c is a normalization constant, fd = v/X is the maximum 
Doppler shift, A is the wavelength of the carrier frequency, 
and v is the speed of the mobile [ 30 1 . The channel powers are 
normalized so that Yli=iPi = !• This procedure corresponds 
to the generation of independent sequences of correlated unit 
power Rayleigh random variables (£^[|o? = 1). We show 
the results in terms of the normalized Doppler frequency 
fdT (cycles/symbol) and use three-path channels with relative 
average powers pi given by 0, —3 and —6 dB, where in each 



run the spacing between paths is obtained from a discrete 
uniform random variable (d.u.r.v.) between 1 and 3 chips. 

The asynchronism Tk in chips is taken from a d.u.r.v be- 
tween 1 and the respective processing gain of the class of user 
(HD or LD). The parameters of the algorithms are optimized 
and the system has perfect power control between users of 
each class (HD or LD). The activation function <p(.) for the 
neural receivers is the hyperbolic tangent, the number of states 
Q of the RNNs is set to 1, the DOAs 4>k,m are uniformly 
distributed in (— 7r/3, 7r/3) for all simulations and the NLMS 
is used to estimate the effective spatial signature of the receiver 
structure. Experiments are averaged over 200 independent runs 
and the parameter 7 was chosen as V3.5<r 2 for the neural 
MUDs and V5a 2 for the other detectors. The proposed and 
existing algorithms are all initialized with parameter vectors 
with zero elements in order to provide a fair comparison. The 
BER performance shown in the results refers to the average 
BER amongst the Kh HD users. 

In Figs. 3 and 4 we show the BER convergence performance 
of the analyzed algorithms and linear and DF receivers, 
respectively. We also include the Kalman algorithm of [14j 
with J = 3 and the proposed neural structure in the compar- 
ison as an upper bound. The curves show that data-selective 
algorithms outperform the conventional adaptive techniques 
both in convergence speed and steady-state performance, while 
they only perform filter updates for a fraction of the overall 
symbols. The DF schemes, shown in Fig. 4, and in particular 
the DF-NMUD outperform the schemes without DF , shown in 
Fig. 3, and as the number of antenna elements J is increased 
so is the performance. The proposed SM-NRTRL algorithm 
for the neural structure outperforms the NRTRL and has a 
performance very close to the Kalman technique of 1041 . which 
has much higher complexity (O(^)). 

In terms of complexity, the algorithms require 0{QKh) for 
training the neural structures and 0{K) for training the linear 
filters but the SM-NRTRL neural algorithms exhibit better 
performance and lower UR. Indeed, the update rate (UR) of 
the SM algorithms is shown in Table I at E b /N = IQdB 
and fdT — 0.0001, where it is shown the advantage in terms 
of UR of the neural approach combined with SM framework. 
The UR depends on fdT and 7, and we verified that, for 
an extensive set of scenarios, the SM-NRTRL has the ability 
to consistently operate at lower UR than existing algorithms. 
The complexity is mainly dictated by the number of states Q 
of the neural network and the number of HD users Kh- The 
number of antenna elements J increases the complexity of the 
space-time receivers in an identical way for both neural and 
non-neural schemes because the in the proposed structure the 
signal processing either via neural structures or linear ones is 
performed after the bank of ST-RAKE receivers. In this case, 
we used NLMS algorithms to estimate the effective spatial 
signatures of the users, which despread the signals prior to 
the processing of the receiver. 

The BER performance of SM-adaptive algorithms and DF 
receivers is shown in Fig. 5. The receivers are adjusted 
with 200 symbols during the training period, then switch to 
decision-directed mode and process 2000 data symbols. The 
results show that the proposed DF-NMUD achieves the best 
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BER performance, followed by the DF-MUD with linear filters 
(DF-MUD), and the RAKE receivers. 

In the last experiment, shown in Fig. 6, we consider 
randomly generated sequences with N = 8 and receivers 
with single antennas and space-time receivers with J = 3 
antenna elements in order to evaluate a situation that may 
arise in a practical standard, i.e., the 3G UMTS system. In 
this case, the channels are modeled by the UMTS Vehicular 
A channel model ||3D with maximum Doppler frequency 
fxj = 100 Hz with a carrier frequency f c — 2 GHz. The 
receivers are adjusted with 200 symbols during the training 
period, then switch to decision-directed mode and process 
2000 data symbols. The curves show that the proposed DF- 
NMUD achieves the best BER performance, followed by the 
remaining techniques, corroborating the results obtained for 
the previous experiments. 

VI. Conclusions and Future Work 

A space-time adaptive DF neural receiver for high rate users 
in DS-CDMA systems was proposed for joint equalization 
and multiuser detection. In order to estimate the parameters 
of the proposed receiver, we developed a low complexity 
data-selective algorithm was developed based on the concept 
of Set-Membership filtering. Numerical results have shown 
that the proposed neural structure and algorithms considerably 
outperform existing methods. For future work, the proposed 
scheme and algorithms can also be considered and applied 
to other communications systems such as MIMO systems. 
Detectors with attractive trade-off between performance and 
complexity such as the proposed one are of central importance 
in MIMO systems. This would require a description of a 
discrete-time model of MIMO systems and a formulation of 
the problem involving the application of the proposed DF 
neural structure for extracting the desired signals embedded in 
interference. Future work may also involve the investigation of 
novel adaptive estimation algorithms using the set-membership 
framework. Algorithms with faster convergence rate than the 
one proposed here such as least squares and those with data 
reuse may be of interest for the estimation of parameters 
in a larger neural structure, as well as, the formulation of 
set-membership techniques with automatic adjustment of the 
bound. 
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(b) N=7, K h =3 users, E b /N =10 rJB, f d T=0.0001 
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Fig. 4. 

Captions: 

Fig. 1: Block diagram of the space-time DS-CDMA system 
model. 

Fig. 2: Block diagram of the proposed space-time decision 
feedback neural receiver. 

Fig. 3: BER convergence performance of data-seletive 
and conventional (dash-dotted lines) algorithms for linear 
receivers. 

Fig. 4: BER convergence performance of data-seletive and 
conventional (dash-dotted lines) algorithms for DF receivers. 

Table 1: Update Rate of SM algorithms at E b /N a = lOdB 
and f d T = 0.0001. 
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TABLE I 
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NMUD 


DF-MUD 


DF-NMUD 
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Fig. 6. 



Fig. 5: BER performance of DF receivers versus (a) E^/Nq 
and (b) number of HD users (Kh). 

Fig. 6: BER performance of DF receivers versus (a) E^/Nq 
and (b) number of HD users {K^) for a UMTS channel model. 



